RESEARCH ARTICLE 



Epidemic Klebsiella pneumoniae ST258 Is a Hybrid Strain 

Liang Chen, 3 Barun Mathema, ab Johann D. D. Pitout, c d e Frank R. DeLeo,' Barry N. Kreiswirth 3 

Public Health Research Institute Center, New Jersey Medical School, Rutgers University, Newark, New Jersey, USA a ; Department of Epidemiology, Mailman School of 
Public Health, Columbia University, New York, New York, USA b ; Division of Microbiology, Calgary Laboratory Services, Calgary, Alberta, Canada c ; Departments of Pathology 
and Laboratory Medicine, d Department of Microbiology, Immunology, and Infectious Diseases," University of Calgary, Calgary, Alberta, Canada; Laboratory of Human 
Bacterial Pathogenesis, Rocky Mountain Laboratories, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Hamilton, Montana, USA f 

ABSTRACT Carbapenem-resistant Enterobacteriaceae (CRE), especially Klebsiella pneumoniae carbapenemase (KPC)-producing 
K. pneumoniae, pose an urgent threat in health facilities in the United States and worldwide. K. pneumoniae isolates classified as 
sequence type 258 (ST258) by multilocus sequence typing are largely responsible for the global spread of KPC. A recent compara- 
tive genome study revealed that ST258 K. pneumoniae strains are two distinct genetic clades; however, the molecular origin of 
ST258 largely remains unknown, and our understanding of the evolution of the two genetic clades is incomplete. Here we com- 
pared the genetic structures and single-nudeotide polymorphism (SNP) distributions in the core genomes of strains from two 
ST258 clades and other STs (ST11, ST442, and ST42). We identified an ~1.1-Mbp region on ST258 genomes that is homogeneous 
to that of ST442, while the rest of the ST258 genome resembles that of ST11. Our results suggest ST258 is a hybrid clone — 80% of 
the genome originated from STll-like strains and 20% from ST442-like strains. Meanwhile, we sequenced an ST42 strain that 
carries the same K-antigen-encoding capsule polysaccharide biosynthesis gene (cps) region as ST258 clade I strains. Comparison 
of the cps-harboring regions between the ST42 and ST258 strains (clades I and II) suggests the ST258 clade I strains evolved from 
a clade II strain as a result of cps region replacement. Our findings unravel the molecular evolution history of ST258 strains, an 
important first step toward the development of diagnostic, therapeutic, and vaccine strategies to combat infections caused by 
multidrug-resistant K. pneumoniae. 

IMPORTANCE Recombination events and replacement of chromosomal regions have been documented in various bacteria, and 
these events have given rise to successful pathogenic clones. Here we used comparative genomic analyses to discover that the 
ST258 K. pneumoniae genome is a hybrid — 80% of the chromosome is homologous to ST1 1 strains, while the remaining 20% is 
homologous to that of ST442. Meanwhile, a recent study indicated that ST258 strains can be segregated into two ST258 clades, 
with distinct capsule polysaccharide gene (cps) regions. Our analysis suggests ST258 clade I strains evolved from clade II through 
homologous recombination of cps region. Horizontal transfer of the cps region appears to be a key element driving the molecular 
diversification in K. pneumoniae strains. These findings not only extend our understanding of the molecular evolution of ST258 
but are an important step toward the development of effective control and treatment strategies for multidrug-resistant K. pneu- 
moniae. 
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lebsiella pneumoniae carbapenemase (KPC) has emerged as a se- 
f \ rious clinical challenge in health care facilities in the United States 
and worldwide ( 1 ) . The Wa^c-harboring plasmid encoding the car- 
bapenemase has been found in numerous K. pneumoniae sequence 
types (STs)/clones as well as in other Gram-negative species; how- 
ever, the vast majority of the global KPC-producing K. pneumoniae 
isolates are associated with a single multilocus sequence type — 
ST258 (2-A). K. pneumoniae ST258 emerged as a notable clinical 
problem in the middle 2000s in the United States and remains the 
main ST in the United States and elsewhere (3-6). 

Recently, two KPC-harboring K. pneumoniae ST258 clinical 
isolates were sequenced to closure (7). These genomes were used 
as references for a comparative genome analysis of 83 ST258 clin- 



ical isolates collected between 2002 and 2012 from geographically 
diverse sources. Phylogenetic analysis of the core genome of these 
isolates revealed that ST258 K. pneumoniae strains are comprised 
of two distinct genetic clades (ST258 clades I and II), largely due to 
an ~215-kb region of divergence (RD) that includes genes in- 
volved in capsular polysaccharide (CPS) biosynthesis (7). Further 
genotyping analysis with 2 cps-associated genes, wzi and wzy, in 
ST258 and other unrelated K. pneumoniae strains identified the 
ST258 clade I genotype in genetically distinct ST42 strains (7). 
Interestingly, a GenBank BLAST search using nucleotides encom- 
passing the ST258 clade II cps region indicated this region is highly 
similar to that of a Brazilian ST442 strain, Kpl3, which harbors 
cps Kpl3 (8, 9). 
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TABLE 1 Features of completely sequenced ST258, ST1 1, and ST442 genomes 
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57.5 
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57.5 
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25 
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86 


Plasmids (n) 
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3 


6 
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6 
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8 


7 


7 
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8 


2 
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2 


2 


2 


1 


3 


1 
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22 


19 


23 


14 


23 


31 
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a ICE, integrated conjugative element; IS, insertion sequence; CDS, coding sequences. 



The emergence and global spread of ST258 and recent reports 
that this clone has diversified as a result of recombination and 
replacement of the cps region raise the question of its evolutionary 
history. One speculation is that ST11 (allelic profile 3-3-1-1-1-1- 
4), a highly predominant multidrug-resistant clone in Asia and 
South America (10-12), and a single-locus variant of ST258 (al- 
lelic profile 3-3-1-1-1-1-79) gave rise to the ST258 clone through 
the acquisition of the tonB79 allele (13). 

To better understand the phylogeny of the ST11 and ST258 
lineages, we compared the genomes of three ST 11 strains 
(HS11286, JM45, and ATCC BAA-2146), three ST258 strains 
(NJST258_1, NJST258_2, and Kpl787 [a representative ST258 
clade I strain present in our collection]), and an ST42 strain, 
Kpl832. The comparative analysis of these genomes indicates that 
large and repeated chromosomal exchanges in K. pneumoniae 
have occurred between ST1 1 and ST258, with a significant role for 
ST442 in the recent molecular evolution of epidemic ST258 
strains. 

RESULTS 

Large ~1.1-Mbp recombination region in ST258. To elucidate 
the phylogenetic relationship among ST258, ST11, and ST442 
strains, we first compared the genome sequences of six closed 
Klebsiella pneumoniae strains (Table 1). The size of the chromo- 
somes was on average -5.3 Mbp, but the number of mobile ge- 
netic elements (MGEs), including plasmids, prophages, inte- 
grated conjugative elements (ICEs), and insertion sequences (IS), 
varied (Table 1; see Fig. SI in the supplemental material). Consis- 
tent with multilocus sequence typing (MLST) (Fig. 1A), which 
indicates ST11 and ST258 differ by a single locus (the tonB allele 
distinguishes the two sequence types), the 3 ST11 and 2 ST258 



genomes have 7 of 8 prophages in common, and all harbor 
ICEKp258.1 (see Fig. SI). Sequence comparison among the tonB 
alleles shows tonB79 (in ST258) differs from tonB4 (in ST11) by 
four single-nucleotide polymorphisms (SNPs) and differs from 
tonBU (in ST442) by a single SNP. Of note, the three ST1 1 strains 
(HS1 1286, JM45, and ATCC BAA-2146) harbor three different cps 
operons, a finding similar to the distinguishing cps genotypes in 
ST258 clade I and II strains and which supports the observation 
that cps switching provides K. pneumoniae the plasticity to change 
its antigenic nature (Fig. IB). 

Comparative genome and SNP distribution analyses of the 
core chromosome region, as depicted in Fig. IB, uncovered a 
number of surprising findings given the MLST results for ST 11 
and ST258. Except for differences in the tonB allele and the region 
encoding the capsular polysaccharide biosynthetic machinery, the 
6 genomes of ST1 1 and ST258 have a high degree of identity (re- 
gions of the same color in Fig. IB). However, further analysis of 
the RD and flanking nucleotides revealed that the differences be- 
tween ST11 and ST258 were expansive, covering an ~1.1-Mbp 
contiguous region corresponding to nucleotide positions 
1,660,631 to 2,723,681 in strain N)ST258_1 (Fig. 1). Significantly, 
the ~1.1-Mbp region identified in ST258 clade I and II strains has 
identical chromosomal nucleotide boundaries (Fig. 2). 

Analysis of SNPs in the genomes of ST258 strains (NJST258_1, 
NJST258_2, and Kpl787) and ST11 strains (HS11286, JM45, and 
ATCC BAA-2146) indicated that these strains differ by an average 
of 9,647 SNPs, and 98.1% (9,460 SNPs) of these polymorphisms 
are concentrated in the contiguous -1.1 -Mbp region (identified 
above), which represents 20% of the genome (Fig. 3). By compar- 
ison, the genomes of ST258 strains and ST442 strain Kpl3 differed 
by 21,095 SNPs, consistent with their genetically distinct MLST 
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FIG 1 (A) MLST allele locations onNJST258_l genome. The light green arrow denotes the genome ofNIST258_l, and the light blue region shows the ~1.1-Mbp 
putative recombination region between the ST1 1 and ST442 genomes. The chromosomal positions of the seven MLST housekeeping genes (gapA, infB, mdh, pgi, 
phoE, rpoB, and tonB) are illustrated beneath the genome arrow of NJST258_1, and the corresponding allele numbers for ST11, ST258, ST442, and ST42 are listed 
below the gene names. (B) Core genome SNP distributions in the ST11, ST258 (cladesl and II), ST442, andST42 strains. The number of SNPs (yaxis) per 1,000 nt 
is plotted according to the position on the NJST258_1 genome (x axis). Different homogeneous regions (>98% identity based on SNP comparisons) are color 
coded. Specifically, the ~52-kb cps-containing regions in Kpl787 (ST258 clade I) and Kpl832 (ST42), which are nearly identical in these strains, are shaded in 
green. ICEKp258.2 and cps are illustrated by small vertical bars, and the same cps regions are shown in the same color. 



profiles (Fig. IB and SNP matrix in Fig. 3). Most significantly, the 
SNP mapping revealed contiguous ~1.1-Mbp regions that were 
nearly identical in ST258 clade II (NJST258_1 and NJST258_2) 
and ST442 (Kpl3) strains, differing by only 206 SNPs (1.0%). As 
depicted in Fig. IB, the comparative genomic organization and 
SNP results provide additional support to the idea that the ST258 
clade II strain is a hybrid strain containing 80% (-4.2 Mbp) of the 
chromosome from ST11 and 20% (-1.1 Mbp) from ST442 
(Fig. IB). 

The -1.1 -Mbp chromosomal region in ST258 clade I and II 
strains contains the ~215-kb RD and the cps gene cluster (Fig. 2). 
ICEKp258.2 is common to the prototype ST258 strains shown in 
Fig. IB but is absent from ST442 strain Kpl3. To determine the 
level of conservation of ICEKp258.2 among ST258 clinical iso- 
lates, we analyzed the DNA contigs of 83 additional ST258 ge- 
nomes sequenced in our previous study (7). We found that 
ICEKp258.2 is conserved in all of the queried ST258 genomes, and 
the insertion of this element in ST258 clade I and II genomes is at 
the same tRNA-Asn site (data not shown). 



cps replacement in ST258 strains. In our previous study, we 
identified seven ST42 strains that harbored cps genetic markers 
(wzy and wzi) that are identical to those in ST258 clade I strains 
(7), and we hypothesized that this unrelated sequence type (ST42) 
was the donor for the cps region in ST258 clade I strains. As a first 
step toward testing this hypothesis, we used Illumina Miseq to 
sequence the DNA in the cps region of ST42 and ST258 clade I 
strains and that in strain Kpl832, a representative ST42 isolate in 
our strain collection. The gross organization between ST42 and 
ST258 strains indicates their distal genetic relatedness (Fig. IB), a 
finding consistent with MLST data. An SNP analysis confirmed 
that there was significant genome divergence between ST258 and 
ST42 strains. A total of 31,157 SNPs distinguished the three ST258 
strains from the ST42 strain, Kpl832, and 27% of these SNPs 
(8,444 SNPs) were located in the -l.l-Mbp recombination region 
(Fig. 3). 

Since the two sequenced reference strains (NJST258_1 and 
NJST258_2) were genotyped as ST258 clade II strains, we created 
a de novo genome sequence of the prototypic ST258 clade I strain 
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ST258-II 
ST258-I 
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FIG 2 Upstream and downstream junction SNPs for the ~1.1-Mbp recombination fragment and cps region in ST258, ST442, and ST42 strains. The start site of 
the replacement of the ~52-kb cps-harboring region is the same as that of the ~215-kbRDin ST258 II clades (7). Sequences were obtained from Kpl878 (an ST258 
clade I strain), NIST258_1 (an ST258 clade II strain), IM45 (an ST11 strain), Kpl832 (an ST42 strain), and Kpl3 (an ST442 strain). 



(Kpl787) to use as a clade I reference genome. An ~450-kb region 
of the Kpl787 genome, which contains the entire ~215-kb RD, 
was used as a reference to examine recombination events between 
Kpl832 (ST42) and Kpl787 (an ST258 clade I strain). The com- 
parison revealed a nearly identical region (2 SNPs) spanning 
-52 kb that contains the cps region; the alignment of the two 
regions maps to the start of this DNA replacement at the same 
location in the RD region (Fig. 2). In addition, the ICEKp258.2 
element is absent from strain Kpl832, and there is nucleotide 
divergence outside the aforementioned ~52-kb region (Fig. 1 and 
2). 

Comparative sequence analysis further showed that the neigh- 
boring sequences upstream and downstream from this ~52-kb 
region are identical in ST258 clade I and II strains (Fig. 2). In 
addition, the SNP distribution among the 85 ST258 genomes re- 
ported in our previous study revealed that 592 (89%) of the 664 
SNPs in the RD are located within the ~52-kb cps-harboring re- 
gion (7). Together, the genomic findings are consistent with the 
hypothesis that clade I evolved rapidly through the acquisition of 
the cps region from an ST42 strain. The evidence provided above 
strongly suggests that replacement of the original (presumably 
ST258 clade II) cps region in clade I contributes largely to the 
noted phylogenetic difference between the two ST258 clades. 

2>Za KPC -harboring genetic element. We and others have re- 
ported that the bla KPC gene in ST258 strains is carried exclusively 
by a Tn3-based transposon, Tn4401 (5, 7, 14). Tn440i is 10 kb in 
length, delimited by two 39-bp imperfect inverted repeat (IR) se- 
quences, and harbors the bla^Q gene, a Tn3 transposase gene 
(tnpA), a Tn3 resolvase gene (tnpR), and two insertion sequences, 
lSKpn6 and lSKpn7 (15) (see Fig. S2 in the supplemental mate- 
rial). In contrast, ST 11 and ST442 strains harbor bla KPC - 
containing elements that are distinct from those in ST258 strains 
(see Fig. S2) and share only ~2 kb of sequence with Tn440i. Col- 



lectively, these findings suggest ST258 strains are hybrid strains 
that arose from an ancestral ST 11 strain that acquired an -1.1- 
Mbp contiguous chromosomal segment from an ST442-like 
strain by DNA recombination/replacement. The identification of 
distinct Wa KPC -harboring elements in ST258, ST11, and ST442 
strains indicates bla KPC was acquired by ST258 strains via horizon- 
tal gene transfer (rather than by vertical gene transmission from 
ST11 or ST442 parental strains) after the recombination events. 

DISCUSSION 

The current rise of KPC-producing K. pneumoniae infections in 
U.S. health care facilities has been overwhelmingly associated with 
strains typed as ST258. To better understand the evolutionary 
history of this epidemic clone, we compared the genome se- 
quences of ST258 strains, single-locus variant ST11 strains, and 
other selected K. pneumoniae strain types. Notably, we discovered 
that ST258 strains are hybrid strains comprised of genomic DNA 
from ST11 (-80%) and ST442 (~20%)-like strains— presumably 
the product of a large chromosomal replacement event. 

Recombination events and replacement of large chromosomal 
regions have been documented in various bacteria, and there are 
reported examples where the hybrid strains are associated with 
epidemiological success. Robinson and Enright were the first to 
report a naturally occurring bacterial hybrid — in this case, in a 
Staphylococcus aureus strain known as ST239 (16). This hybrid 
strain is a pandemic methicillin-resistant S. aureus (MRSA) strain 
responsible for -90% of the nosocomial infections throughout 
mainland Asia and much of South America (17). ST239 is com- 
prised of large chromosomal regions from two distantly related 
lineages, ST8 and ST30. Approximately 20% of the ST8 genome 
was replaced with an ~550-kb contiguous chromosomal fragment 
from an ST30 donor strain, thereby creating ST239. This appar- 
ently rare molecular event has not been explained or reproduced 
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FIG 3 (A) SNP matrix for different K. pneumonaie strains. The matrix is illustrated as (total no. of SNPs/no. of SNPs in the ~1.1-Mbp recombination region). 
Green shading indicates the number of SNPs in ST258 strains compared to that in ST1 1 strains. Orange shading indicates the number of SNPs in ST258 strains 
compared to that in ST442 strains. (B) Phylogenetic analysis of the eight isolates based upon 52,135 concatenated SNPs in the core genome. 



in the laboratory. In group B Streptococcus (GBS), large single 
chromosomal replacement events and multiple localized recom- 
bination events occur naturally and can be reproduced in the lab- 
oratory (18). For GBS, conjugation is the molecular pathway for 
genomic movement (19). It is worth noting that genetic replace- 
ment of the GBS cps region between unrelated sequence types is 
the common mechanism by which this species alters its surface 
antigen composition (18). Similarly, cps region replacement- 
associated capsular switching has also been suggested as being an 
intrinsic feature throughout the evolutionary history of Strepto- 
coccus pneumoniae (20). 

Here we discovered that ST258 clade II strains are hybrid strains in 
which 20% of the K. pneumoniae ST1 1 genome was replaced with a 



homologous -1.1 -Mbp contiguous region from a strain in the ST442 
lineage. This region, which includes the previously described region 
of difference (RD) and capsular polysaccharide biosynthetic genes, 
has molecular scars of multiple localized recombination events, sim- 
ilar to the phenomenon in Streptococcus agalactiae ( 18, 19). The find- 
ing that ST442 and ST258 have a contiguous chromosomal region in 
common and that the nucleotide boundaries between the ST442 and 
ST258 clade I and II genomes are indistinguishable (Fig. 2) is evidence 
that the recombination event creating an ST11 and ST442 hybrid 
strain likely occurred once, thereby creating the ST258 clade II lineage 
(Fig. 4). 

Based on recent genome-scale studies, there have been numer- 
ous putative chromosomal recombination events involving the 
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FIG 4 Hypothesized evolutionary history in K. pneumoniae ST258 strains. 



region encoding the CPS biosynthetic machinery (7). ST258 
clades I and II have distinct cps regions, and this is also true of the 
three ST11 strains analyzed in this study (Fig. IB), further sup- 
porting the notion that DNA exchange in and around the cps 
regions may be a general mechanism used by K. pneumoniae to 
rapidly diversify and that novel clades arise through cps switching 
between ST258 (clade I or II) and unrelated K. pneumoniae se- 
quence types. 

The presence, absence, and diversity of genetic landmarks 
within the acquired ~1.1-Mbp region provide clues into ST258's 
recent evolutionary origin and the extent of its genomic plasticity. 
ICEKp258.2, which is absent in ST442, is present in both ST258 
clades, where its chromosomal insertion site is conserved, suggest- 
ing that this ICE was acquired after the major genome recombi- 
nation event that gave rise to ST258 (Fig. 4). In support of the 
notion that ICEKp258.2 is a relatively recent acquisition, the G+C 
content of ICEKp258.2 is 37.1%, significantly less than the 
-57.5% G+C content of the entire K. pneumoniae chromosome. 
Taken together, these observations provide evidence that 
ICEKp258.2 is exogenous and was likely acquired once by ST258, 
before the recombination events involving the cps regions in 
clades I and II (Fig. 4). Moreover, the replacement of the ST258 
clade II cps region with that from ST42 (thus creating clade I) 
occurred after the acquisition of ICEKp258.2 (Fig. 4). 

Adler and colleagues investigated the association of the 
ICEKp258.2 with ST258 by testingl60 K. pneumoniae strains with 
diverse sequence types for the presence of pilV, a gene carried on 
ICEKp258.2 (21). They found that pilVwas present only in ST258 
and genetically related strains. Based on sequence analysis, 
ICEKp258.2 harbors a type IV pilus gene cluster and a type III 
restriction-modification system. A type IV pilus could increase the 
uptake and exchange of DNA, such as plasmids, as well as facilitate 
adherence to living and nonliving surfaces — e.g., the human gut 
or the environment (22) — which may in part explain the high 
transmissibility of ST258 strains and the movement of KPC genes. 
Additionally, a type III restriction-modification system could 
serve in "host specificity" regarding the exchange of certain com- 
patible plasmids and other mobile elements (23). Restriction of 
plasmids and specific mobile elements may explain the differences 
observed between ST11 (which lacks ICEKp258.2) and ST258, as 



the former is associated with a broad range of plasmids and car- 
bapenemases (KPC, VIM, IMP, NDM, and OXA-48) (10-12, 24- 
28), whereas ST258 strains predominantly harbor KPC. Taken 
together, the association of ICEKp258.2 with ST258 K. pneu- 
moniae strains raises the possibility that this element may contrib- 
ute to epidemiological success of this sequence type. To investigate 
whether ICEKp258.2 could potentially be an "epidemic clone- 
specific" target, we are currently investigating the impact of alter- 
ing the type IV pilus gene cluster and the type III restriction- 
modification system in this element. 

Taken together, our findings underscore the role of recombi- 
nation in the rapid evolution of clinical strains of K. pneumoniae 
in both creating hybrid clones and in more localized chromo- 
somal replacements that alter antigenic presentation and ulti- 
mately divert the host response. 

MATERIALS AND METHODS 

Sequence information. Data used in comparative analysis were down- 
loaded from the NCBI database (http://www.ncbi.nlm.nih.gov/genome/ 
genomes/815), including complete genome sequences and annotation of 
K. pneumoniae isolates HS11286 (CP003200) (29), IM45 (CP006656), 
ATCC BAA-2146 (CP006659) (30), Kpl3 (CP003999) (8), 
NJST258_1(CP006923) (7), andNFST258_2 (CP006918) (7). Additional 
sequence data were retrieved from our recent study on K. pneumoniae 
ST258 (7). 

Genome sequencing and assembly. Strain Kpl832 was selected from 
one of the seven ST42 K. pneumoniae isolates that carry the same wzy and 
wzi genes as ST258 cps-1 strains (7). Genomic DNA isolation and library 
preparation were performed as described previously (7). The genome was 
sequenced using an Illumina MiSeq platform, which generated 250-bp 
paired-end reads. De novo assembly for Kpl832 and Kpl787 (a represen- 
tative ST258 clade I strain, selected from our previous study [7]) was 
accomplished by using a combination of CLC genomic workbench (v 
7.0.3; CLC Bio, Aarhus, Denmark), Mira (31), and Velvet (32). The best 
assemblies from each method were combined using Geneious Pro soft- 
ware in order to generate the supercontig for the cps-harboring element. 

Comparative genomics analysis. Visualization of circular genome 
comparisons was performed using the BLAST ring image generator 
(BRIG) (33). Prophages were identified by PHAST (34). Insertion se- 
quences were identified using the IS Finder database (http://www 
-is.biotoul.fr). De novo assembled contigs from Kpl832 and Kpl787 were 
ordered and oriented relative to the NIST258_1 genome and then com- 
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bined together as a pseudochromosome using the Mauve contig mover 
(35). Multiple genome sequence alignments and comparison analysis 
were then performed with Mauve (35). For core genomic analysis, SNPs 
located on the MGEs, including prophases, ICEs, and insertion elements, 
as well as those on rRNAs and tRNAs, were excluded. The concatenated 
SNPs were used to generate a consensus phylogenetic tree by the maxi- 
mum likelihood method based on the Tamura-Nei model with the MEGA 
5 software (36). The SNP distribution among different genome sequences 
was inferred from genome alignment using Mauve (35), and SNPs were 
counted on a 1,000-nucleotide (nt) window based on the nucleotide po- 
sition on NJST258_1. 

Nucleotide sequence accession numbers. IUumina short read data for 
ST258 have been deposited in the Sequence Read Archive (SRA) database 
under accession no. SRP036874 (7). The Illumina short read data for 
Kpl832 have been deposited in the SRA database under accession no. 
SRX512850. 

SUPPLEMENTAL MATERIAL 

Supplemental material for this article may be found at http://mbio.asm.org/ 
lookup/suppl/doi:10.1128/mBio.01355-14/-/DCSupplemental. 

Figure SI, EPS file, 25.9 MB. 

Figure S2, EPS file, 1.1 MB. 
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